Selectional Restrictions in HPSG* 
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Abstract 

Selectional restrictions are semantic sortal con- 
straints imposed on the participants of lin- 
guistic constructions to capture contextually- 
dependent constraints on interpretation. De- 
spite their limitations, selectional restrictions 
have proven very useful in natural language ap- 
plications, where they have been used frequently 
in word sense disambiguation, syntactic disam- 
biguation, and anaphora resolution. Given their 
practical value, we explore two methods to in- 
corporate selectional restrictions in the hpsg 
theory, assuming that the reader is familiar with 
HPSG. The first method employs hpsg's back- 
ground feature and a constraint-satisfaction 
component pipe-lined after the parser. The 
second method uses subsorts of referential in- 
dices, and blocks readings that violate selec- 
tional restrictions during parsing. While the- 
oretically less satisfactory, we have found the 
second method particularly useful in the devel- 
opment of practical systems. 

1 Introduction 

The term selectional restrictions refers to se- 
mantic sortal constraints imposed on the par- 
ticipants of linguistic constructions. Selectional 
restrictions are invoked, for example, to account 
for the oddity of @ and (D (cf. (|) and (|)). 

(1) ?Tom ate a keyboard. 

(2) Tom ate a banana. 

(3) ?Tom repaired the technician. 

(4) Tom repaired the keyboard. 
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To account for (|T]) and (^), one would typically 
introduce a constraint requiring the object of 
"to eat" to denote an edible entity. The odd- 
ity of (2) can then be attributed to a violation 
of this constraint, since keyboards are typically 
not edible. Similarly, in (^) and (^) one could 
postulate that "to repair" can only be used with 
objects denoting artifacts. This constraint is vi- 
olated by dH), because technicians are typically 
persons, and persons are not artifacts. 

We note that selectional restrictions attempt 
to capture contextually-dependent constraints 
on interpretation. There is nothing inherently 
wrong with (|^), and one can think of special 
contexts (e.g. where Tom is a circus performer 
whose act includes gnawing on computer pe- 
ripherals) where @) is felicitous. The oddity 
of ([l|) is due to the fact that in most contexts 
people do not eat keyboards. Similarly, @ is 
felicitous in a science-fiction context where the 
technician is a robot, but not in most usual con- 
texts. Selectional restrictions are typically used 
to capture facts about the world which are gen- 
erally, but not necessarily, true. 

In various forms, selectional restrictions have 
been used for many years, and their limitations 
are well-known (Allen, 1995). For example, 
they cannot account for metaphoric uses of lan- 
guage (e.g. (^)), and they run into problems in 
negated sentences (e.g. unlike (|l]), there is noth- 
ing odd about @). 



(5) My car drinks gasoline. 

(6) Tom cannot eat a keyboard. 

Despite their limitations, selectional restric- 
tions have proven very useful in practical apph 
cations, and they have been employed in sev- 
eral large-scale natural language understanding 
systems ([Martin et al, 1986| ) (|Alshawi, 1992|) 



Apart from blocking pragmatically ill-formed 
sentences like and selectional restric- 
tions can also be used in word sense disambigua- 
tion, syntactic disambiguation, and anaphora 
resolution. In (0), for example, the "printer" 
refers to a computer peripheral, while in (^) it 
refers to a person. The correct sense of "printer" 
can be chosen in each case by requiring the ob- 
ject of "to repair" to denote an artifact, and the 
subject of "to call" (when referring to a phone 
call) to denote a person. 

(7) Tom repaired the printer. 

(8) The printer called this morning. 

Similarly, is from a syntactic point of view 
potentially ambiguous: the relative clause may 
refer either to the departments or the employ- 
ees. The correct reading can be chosen by speci- 
fying that the subject of "retire" (the relativised 
nominal in this case) must denote a person. 

(9) List the employees of the overseas depart- 
ments that will retire next year. 

Given the value of selectional restrictions in 
practical applications, we explore how they can 



be utilised in the HPSG theory ( Pollard and 



Sag, 1994), assuming that the reader is familiar 
with HPSG. Our proposals are based on expe- 
rience obtained from using HPSG in a natural 
language database interface ( Androutsopouloj 
et al., 1998| ) and a dialogue system for a mobile 
robot. To the best of our knowledge, selectional 
restrictions have not been explored so far in the 
context of HPSG. 

We note that, although they often exploit 
similar techniques (e.g. semantic sort hierar- 
chies), selectional restrictions costitute a differ- 



ent topic from linking theories (Davis, 1996). 
Roughly speaking, linking theories explore the 
relation between thematic roles (e.g. agent, pa- 
tient) and grammatical functions (e.g. subject, 
complement), while selectional restrictions at- 
tempt to account for the types of world entities 
that can fill the thematic roles. 

We discuss in sections 2 and 3 two ways that 
we have considered to incorporate selectional 
restrictions into HPSG. Section ^ concludes by 
comparing briefly the two approaches. 



2 Background restrictions 

The first way to accommodate selec- 
tional restrictions in HPSG uses the 
CONTEXT I BACKGROUND (abbreviated here 
as Cx|bg) feature, which Pollard and Sag 



( Pollard and Sag, 1994 ) reserve for "felicity 
conditions on the utterance context", "presup- 
positions or conventional implicatures" , and 
"appropriateness conditions" {op cit pp. 27, 
332). To express selectional restrictions, we add 
qfpsoas (quantifier-free parameterised states 
of affairs) with a single semantic role (slot) 
in cx|bg.|^ For example, apart from the eat 
qfpsoa in its nucleus (nuc), the lexical sign 
for "ate" (shown in (|lO[)) would introduce an 
edible qfpsoa in BG, requiring [D (the entity 
denoted by the object of "ate") to be edible. 



(10) 



PHON {ate) 



SS LOG 



GAT 

GONT I NUG 
eat 

GX I BG I 



HEAD verb 
SUBJ ( NP[Y] ) 
GOMES ( NP[2] ) 
EATER [II 
EATEN [U 



edible 



INST [H 



In the case of lexical signs for proper names 



(e.g. (11)), the treatment of Pollard and Sag 
inserts a naming {namg) qfpsoa in BG, which 
requires the bearer (brer) to by identifiable 
in the context by means of the proper name. 
(1Ti| ) also requires the bearer to be a man. 



:ii) 



PHON ( Tom) 

CAT 



SS LOG 



GONT 



GX I BG 



HEAD noun^ 

INDEX H] 
RESTR {} 

BRER \B 



namg 
. man ^ 



NAME Tom 
INST nil 



The HPSG principles that control the propa- 
gation of the BG feature are not fully developed. 
For our purposes, however, the simplistic prin- 
ciple of contextual consistency of Pollard and 
Sag will suffice. This principle causes the BG 



^To save space, we use qfpsoas wherever Pollard and 
Sag use quantified ps oas. We also ignore tense and as- 
pect here. Consult ( Androutsopoulos et al., 1998 ) for 
the treatment of tense and aspect in our HPSG-based 
database interface. 



value of each phrase to be the union of the BG 
values of its daughters. Assuming that the lex- 
ical sign of "keyboard" is (|l|), cause 
(|l]) to receive ( p^ , that requires [2] to denote an 
edible keyboard. 



entity 



(12) 



PHON (keyboard) 

CAT HEAD noMnj 

INDEX [U 
_CX I BG {} 



SS LOG 



(13) 



PHON 



SS I LOG 



Tom, ate, a, keyboarc 
HEAD verb 
CAT SUBJ ( ) 

COMPS ( ) 
QUANTS ( 



CONT 



CX EG < 



NUG 



namg 
man 
edible 



[iNST \2}])' 
keybd ^ J' 

EATER [II 

EATEN [U 



eat 
BRER H] 
NAME Tom 



[iNST 
[iNST [H] 



According to (|^), to accept (|T]), one has to 
place it in a special context where edible key- 
boards exist (e.g. (||) is felicitous if it refers to a 
miniature chocolate keyboard). Such contexts, 
however, are rare, and hence (0) sounds gen- 
erally odd. Alternatively, one has to relax the 
BG constraint that the keyboard must be edible. 
We assume that special contexts allow particu- 
lar BG constraints to be relaxed (this is how we 
would account for the use of (||) in a circus con- 
text), but we do not have any formal mechanism 
to specify exactly when BG constraints can be 
relaxed. 

Similar comments apply to ( |^). Assuming 
that the sign of "repaired" is (M), and that 
the sign of "technician" is similar to (^) ex- 
cept that it introduces a technician index, (|3|) 
receives a sign that requires the repairer to be a 
technician who is an artifact. Technicians, how- 
ever, are generally not artifacts, which accounts 
for the oddity of 




non arfct artifact 



man technician ■■■ \J \J ■■■ keybd 

edible animal edible non afct 



male_lech 



banana 



Figure 1: A simplistic semantic hierarchy 



(14) 



PHON 



SS LOG 



CAT 



GONT NUC 



GX BG 



HEAD verb 

SUBJ ( NP[Y] ) 

GOMPS ( NP[2] ) 

REPAIRER H] 

REPAIRED [2] 
repair '- 

/ [iNST M\ 

l^artifact 



Let us now consider how a computer system 
could account for (|l])-(Q). For example, how 
would the system figure out from dH) that @ 
is pragmatically odd? Among other things, it 
would need to know that keyboards are not edi- 
ble. Similarly, in (^) it would need to know that 
bananas are edible, and in (|3|)-(|^ it would need 
to be aware that technicians are not artifacts, 
while keyboards are. Systems that employ se- 
lectional restrictions usually encode knowledge 
of this kind in the form of sort hierarchies of 
world entities. A simplistic example of such a 
hierarchy is depicted in figure |l[ The hierarchy 
of figure |l] shows that all men and technicians 
are persons, all persons are animate entities, all 
animate entities are physical objects, and so on. 
Some (but not all) persons are both technicians 
and men at the same time; these persons are 
members of the male-tech sort. Similarly, all 
bananas are edible and not artifacts. No person 
is edible, because the sorts person and edihle 
have no common subsorts. 

It is, of course, extremely difficult to con- 
struct hierarchies that include all the sorts of 
world entities. In natural language systems that 
target specific and restricted domains, however. 



constructing such hierarchies is feasible, because 
the relevant entity sorts and the possible hi- 
erarchical relations between them are limited. 
In natural language database interfaces, for ex- 
ample, the relevant entity sorts and the rela- 
tions between them are often identified dur- 
ing the desing of the database, in the form 
of entity-relationship diagrams. We also note 
that large-scale semantic sort hierarchies are al- 
ready in use in artificial intelligence and natural 
language generation projects (for example, Cyc 
([Lenat, 1995| ) and kpml's Upper Model (|Bate-| 
man, 1997] )), and that the techniques that we 



discuss in this paper are in principle compatible 
with these hierarchies. 

To decide whether or not a sentence violates 
any selectional restrictions, we collect from the 
CONT and BG features of its sign ((13) in the 
case of (^) all the single-role qfpsoas for which 
there is a sort in the hierarchy with the same 
name. (This rules out single-slot qfpsoas in- 
troduced by the CONTs of intransitive verbs.) 
The decision can then be seen as a constraint- 
satisfaction problem, with the collected qfpsoas 
acting as constraints. (^) shows the constraints 
for (0), rewritten in a form closer to predicate 
logic. HPSG indices (the boxed numbers) are 
used as variables. 

(15) keybd(0} A man([T]) A edible{\2\) 

Given two contstraints ci , C2 on the same vari- 
able, ci subsumes ci if the corresponding hier- 
archy sort of ci is an ancestor of that of C2 or 
if ci = C2. ci and C2 can be replaced by a new 
single constraint c, if c\ and C2 subsume c, and 
there is no other constraint c' which is subsumed 
by ci , C2 and subsumes c. c and d must be con- 
straints on the same variable as ci , C2 , and must 
each correspond to a sort of the hierarchy. If the 
constraints of a sentence can be turned in this 
way into a form where there is only one con- 
straint for each variable, then (and only then) 
the sentence violates no selectional restrictions. 

In (p!5|), keybdi^ and edihlei^ cannot be re- 
placed by a single constraint, because ktybd and 
edible have no common subsorts. Hence, a se- 
lectional restriction is violated, which accounts 
for the oddity of ([l|). In contrast, in (^) the 
constraints would be as in (0). 



6anana([2]) and edible(j2l) can now be re- 
placed by banana{\2\), because both subsume 
banana(0), and no other constraint subsumed 
by both banana{\2l) and edible{\2i) subsumes 
banana{\2i)- This leads to (|T7|) and the conclu- 
sion that (^) does not violate any selectional 
restrictions. 

(17) banana (0) A man(Jl\) 

This constraint-satisfaction reasoning, how- 
ever, requires a separate inferencing component 
that would be pipe-lined after the parser to rule 
out signs corresponding to sentences (or read- 
ings) that violate selectional restrictions. In the 
next section, we discuss an alternative approach 
that allows hierarchies of world entities to be 
represented using the existing hpsg framework, 
and to be exploited during parsing without an 
additional inferencing component. 

3 Index subsorts 

HPSG has already a hierarchy of feature struc- 



ture sorts (Pollard and Sag, 1994). This hierar- 
chy can be augmented to include a new part 
that encodes information about the types of 
entities that exist in the world. This can be 
achieved by partitioning the ref hpsg sort (cur- 
rently, a leaf node of the hierarchy of feature 
structures that contains all indices that refer 
to world entities) into subsorts that correspond 
to entity types. To encode the information of 
figure H, ref would have the subsorts abstract 
and physical, physical would have the subsorts 
animate, edible, inanimate, and so on. That 
is, referential indices are partitioned into sorts, 
and the indices of each sort can only be an- 
chored to world entities of the corresponding 
type (e.g. keybd indices can only be anchored 
to keyboards). 

With this arrangement, the lexical sign for 
"ate" becomes (p!^). The BG edible restriction 
of (|10|) has been replaced by the restriction that 
the index of the object must be of sort edible. 



(18) 



■pHON {ate) 



SS I LOG 



CAT 



CONT I NUC 
eat 



HEAD verb 
SUBJ ( NP[Y] ) 
COMES ( NP[2] ) 
EATER [T| 

EATEN \2}edible 



(16) banana{\2\) A man{\I]) A edible{\2\) 



Similarly, the sign for "Tom" becomes (|T^) (cf. 



([llD), and the sign for "keyboard" introduces an 
index of sort keybd as shown in ([20|) (cf. (12)). 



qfpsoa 



(19) [PHON (Tom) 



SS LOG 



CAT 
CONT 

CX I BG 



HEAD noun^ 
INDEX [Ilrrean 
RESTR {} 

BRER m 
NAME Tom 

namg 



(20) 



SS LOG 



CAT 
CONT 



PHON {keyboard) 

HEAD noiin] 
INDEX l2]keybd 
RESTR {} 
CX 1 BG {} 

Unification of indices proceeds in the same 
manner as unification of all other typed feature 
structures ( [Carpenter, 1992| ). The parsing of 
dl]) now fails, because it attempts to unify an 
index of sort edible (introduced by (ff^) with 
an index of sort keybd (introduced by (|20D), and 
no HPSG sort is subsumed by both. In contrast, 
the parsing of (Q) would succeed, because the 
sign of "banana" would introduce an index of 
sort banana, which is a subsort of edible (figure 
11); hence the two indices can be unified. (^) and 
(Q) would be processed similarly. 

In (^) and (|8|), there would be two lexical 
signs for "printer": one introducing an index of 
sort printer _per son, and one inroducing an in- 
dex of sort printer -peripheral . [printer .person 
and printer .peripheral would be daughters of 
person and artifact respectively in figure |l|.) 
The sign for "repaired" , would require the index 
of its object to be of sort artifact, and the sign 
of "called" would require its subject index to be 
of sort person. This correctly admits only the 
reading where the repaired entity is a computer 
peripheral, and the caller is a person. Similar 
mechanisms can be used to determine the cor- 
rect reading of (|9|). 

With the approach of this section, it is also 
possible to specify selectional restrictions in the 
declarations of qfpsoas in the hpsg hierarchy of 
feature structures, as shown in figure ^ rather 
than in the lexicon.^ When the same qfpsoa is 
used in several lexical signs, this saves having to 



^Additional layers can be included between afvsoa 
and the leaf so rts, as sketched in section 8.5 of ( Pollard 



EATER animate 
EATEN edible 



REPAIRER person 
REPAIRED artifact 



and Sag, 1994), to group together qfpsoas with common 



semantic roles. 



eat'- repair 

Figure 2: Declarations of qfpsoas 



repeat the same selectional restrictions in each 
one of the lexical signs. For example, the verbs 
"repair" and "&x" may both introduce a repair 
qfpsoa. The restriction that the repaired entity 
must be an artifact can be specified once in the 
declaration of repair in the hierarchy of feature 
structures, rather than twice in the lexical signs 
of "repair" and "fix". 

4 Conclusions 

We have presented two methods to incorpo- 
rate selectional restrictions in hpsg: (i) express- 
ing selectional restrictions as background con- 
straints, and (ii) employing subsorts of referen- 
tial indices. The first method has the advantage 
that it requires no modification of the current 
HPSG feature structures. It also maintains Pol- 
lard and Sag's distinction between "literal" and 
"non-literal" meaning (expressed by CONT and 
BACKGROUND respectively), a distinction which 
is blurred in the second approach (e.g. nothing 
in ( [l8|) shows that requiring the object to denote 
an edible entity is part of the non-literal mean- 
ing; cf. (0)). Unlike the first method, however, 
the second approach requires no additional in- 
ferencing component for determining when se- 
lectional restrictions have been violated. With 
sentences that contain several potentially am- 
biguous words or phrases, the second approach 
is also more efficient, as it blocks signs that 
violate selectional restrictions during parsing. 
In the first approach, these signs remain un- 
detected during parsing, and they may have a 
multiplicative effect, leading to a large number 
of parses, which then have to be checked individ- 
ually by the infer encing component. We have 
found the second approach particularly useful 
in the development of practical systems. 

There is a deeper question here about the 
proper place to maintain the kind of informa- 
tion encoded in selectional restrictions. The 
applicability of selectional restrictions is always 
context-dependent; and for any selectional re- 



striction, we can almost always find a context 
where it does not hold. Our second method 
above effectively admits that we cannot develop 
a general purpose solution to the problem of 
meaning interpretation, and that we have to ac- 
cept that our systems always operate in specific 
contexts. By committing to a particular con- 
text of interpretation, we 'compile into' what 
was traditionally thought of as literal meaning a 
set of contextually-determined constraints, and 
thus enable these constraints to assist in the 
HPSG language analysis without requiring an 
additional reasoning component. We take the 
view here that this latter approach is very ap- 
propriate in the construction of real applications 
which are, and are likely to be for the foresee- 
able future, restricted to operating in limited 
domains. 
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